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ABSTRACT 

We report detections of three z ^ 2.5 submillimeter-selected galaxies (SMGs; SMM J14011+0252, 
SMM J14009+0252, SMM J04431+0210) in the lowest rotational transition of the carbon monoxide 
molecule (CO J = 1 - 0) and one nondetection (SMM J04433+0210). For the three galaxies we 
detected, we find a line-integrated brightness temperature ratio of the J = 3 — 2 and 1 — lines of 
0.68±0.08; the 1 — line is stronger than predicted by the frequent assumption of equal brightnesses in 
the two lines and by most single-component models. The observed ratio suggests that mass estimates 
for SMGs based on J = 3 — 2 observations and J = 1 — column density or mass conversion factors are 
low by a factor of 1.5. Comparison of the 1-0 line intensities with intensities of higher- J transitions 
indicates that single-component models for the interstellar media in SMGs are incomplete. The small 
dispersion in the ratio, along with published detections of CO lines with Jupper > 3 in most of the 
sources, indicates that the emission is from multi-component interstellar media with physical structures 
common to many classes of galaxies. This result tends to rule out the lowest scaling factors between 
CO luminosity and molecular gas mass, and further increases molecular mass estimates calibrated 
against observations of galaxies in the local universe. We also describe and demonstrate a statistically 
sound method for finding weak lines in broadband spectra that will find application in searches for 
molecular lines from sources at unknown redshifts. 

Subject headings: galaxies: high redshift — galaxies: ISM — galaxies: individual (SMM J04431-I-0210, 
SMM J04433-t-0210, SMM J14009-I-0252, SMM J14011-^0252) — techniques: spec- 
troscopic — methods: statistical 



1. INTRODUCTION 

Observations and models of the extra galactic far- 
infrar ed/submillime ter b ackground (e.g.. iPuget et al.l 
119961: iFixsen et all 119981 : iLagache et all l2003| ) indicate 
that a large fraction of cosmic star formation has taken 
place behind a veil of dust. At high redshift, this con- 
clusion is consistently affirmed by comparisons of ob- 
scured and unobscured star forma tion in optically se- 
lected galaxy populations fe.g.. iReddv et al]|2008f i. A 
more striking pattern emerges when one considers all 
galaxy populations, however: the systems forming stars 
at the highest rates (i.e., having t he highest bolometric 
luminosities) are also the dustiest (jAdelberger fc Steidell 
[2000). This trend reaches an extreme in the case of 
submillimeter galaxies (SMGs), first identified over a 
decade ago as bright (> 5 mJy) sources in 850 ^m 
surveys with the Submillimeter Common-user Bolome- 
ter Array (SCUBA) oi i the James Clerk Maxweh Tele- 
scope (JCMT) (e.g.. ISmail et al.l [19971 IBarger et al.l 
[l99l [H^gheTS^aT 199^~SMGs have faint X-ray 



counterparts ( Alexander et al.|[2005) and show the dis- 
turbed morphologies characteristic of major mergers 
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([Consehce et al.l [20031 cf. IDave et al.|[2010l ). suggesting 
they are important sites of mass assembly as well as 
star formation. However, their high obscuration has 
also proved challenging for detailed study, including de- 
termining their precise redshifts and masses, and un- 
derstanding their interstellar media and star formation 
processes. While photometric techniq ues shed some 
light on SMGs' redshift distribution ('e.g. JAretxaga et all 
120031) ■ obtaining more than a handful of spectroscopic 
redshifts for SMGs and their warmer analogs has re- 
quired a painstaking effort of radio c ontinuum map- 
ping followed by optical spectroscopy ([Chapman et al.l 
120031 12004 12005D , whose results have been securely con- 
firmed by the detec tion of CO emi s sion lines at mil- 
limeter wa velengths (Neri et al."2003"; 'Greve et al."2005| 
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Tacconi et al. 2006; Chapman ct al. 2008; Bothwcll et al 
2010). 

Redshift determination is the first step towards ad- 
dressing the crucial question of SMGs' mas ses. Trans- 
lation of SMGs' angular clustering strength ([Blain et al.l 
120041: IWeifi et a l. 2009b) to a linear correlation length 
and dark matter halo mass depends sensitively on their 
exact redshift distribution, although a large character- 
istic halo mass can be independently estimated from 
cosmological simulations assuming a proportionality be- 
tween dark matte r accretion and star formation rates 
([Genel et al.l 120081 ). The total dynamical masses of the 
galaxies themselves are also large, as first sug gested by 
meas urements of large CO line widths (e.g. . [Waver et al.l 
119981 [T999i) and confirmed by spatiall y resolved CO map- 
ping (e.g. [P owncs & Solomon 2003; Genzel et al.l 120031 : 
iTacconi et all |200& i2008t iBothweh et al.ll2010D . How- 
ever, given the challenges of obtaining such mapping 
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and the complications of high extinct i on fo r steUar mass 
determinations (e.g., iHainhne et al.l I2010D . estimating 
SMGs' molecular gas masses from their CO line luminosi- 
ties remains a useful way to place lower limits on their 
total masses. Accurate molecular gas masses are also re- 
quired to determine SMGs' gas mass fractions and star 
formation efficiencies, important inputs for understand- 
ing their evolutionar y status and the l ikely properties 
of th eir descendants (jBaugh et al.ll2005l : iSwinbank et al.l 
[20091) . 

In this paper, we report observations of four SMGs, 
drawn frorn the S CUBA Lens Survey (SLS) sample of 
iSmail et al.l ()2002D . with the ultrawide-bandwidth Zpec- 
trometer cross-correlation spectrometer on the 100 m di- 
ameter Robert C. Byrd Green Bank Telescope (GBT). 
For the three of our targets that are brightest at 850 /im, 
the Zpectrometer has successfully detected the lowest 
(J = 1 — 0) rotational transition of carbon monoxide 
(CO), and we undertake a joint analysis with published 
CO J = 3 — 2 detections for this trio of massive SMGs 
at z ^ 2.5. 

Studies of our own and nearby galaxies have estab- 
hshed the CO J = 1 - hue (v.^st = 115.27 GHz) as 
a proxy for tracing the molecular hydrogen (H2) that 
forms giant molecular clouds. CO is the most abun- 
dant molecule after H2 and, unlike H2, has a permanent 
dipole moment that allows it to radiate efficiently. The 
magnitude of the molecule's dipole moment and close 
spacing between the lowest rungs of its rotational ladder 
allow it to be coUisionally excited to trace gas particle 
densities above a few hundred per cubic centimeter and 
temperatures above 5 K. The strong C-0 bond helps 
keep the molecule stable against dissociation by ultra- 
violet light and shocks, so CO probes active as well as 
quiescent regions. Transitions between rotational states 
with increasing rotational quantum number J trace in- 
creasingly warm and dense gas. Rest-frame submillime- 
ter lines trace particle densities of ^ 10*^^ cm^'^ and 
kinetic temperatures of several tens to a few hundred K. 

For all its advantages as a molecular gas tracer, 
observations of the J = 1 — transition at high redshift 
have been hampered by its relatively low observed 
frequency. Observations of the line are rapidly be- 
coming easier as technology improves, as the run of 
publication dates for detections of galaxies at z > 1 
shows (iPa padopoulos et al .l 120011: ICrcve et al.' '2003"; 
Klamer et al. 2005; Hainli ne et al.l 2006; Riccli crs et al.i 
2006t ISw inbank ct al .l 120101: CariUi ct ahl fMoT 
Ivison et al.i i2010i; Araven a et al.l 2010,; .Negrello et al.l 



20101 iFraver et all 12011; Iv ison et al.n 201l[). Peak flux 



density from optically thick and thermalized emission 
scales roughly as frequency squared, so the 1 — lines 
from distant galaxies are expected to be an order of 
magnitude weaker than the 3 — 2 and other mid-J 
lines from the same sources. The 100 meter diameter 
Green Bank Telescope combines an enormous collecting 
area and a low centimeter-wave system temperature to 
approach the necessary sensitivity level. Stability is a 
key consideration for long integrations, and motivated 
our construction of the Zpectrometer, a cross-correlation 
spectrometer optimized for line searches that instan- 
taneously covers the GBT Ka-band receiver's 25.6 to 
36 GHz band. In this band the CO J = 1 - line 
redshifts over a2.2 < z < 3.5 range that includes 



the peak of the iChapman et al.l ()2005D SMC redshift 
distribution. The Zpectrometer and correlation receiver 
architecture have improved stability to the point that 
detection of 1 — emission is relatively straightforward 
for sources with extreme (M > 10) magnifi c ations 
due to gravi tation al lensing (ISwinbank et al.l 120101 : 
INegrello et aIll2Q10l : IFraver et al.l 1201 ID . In this paper 
we discuss observations of weaker CO 1-0 lines from 
galaxies that have considerably less magnification than 
the brightest sources. 

The prime goal of our observations was to measure 
the ratios of 1 — to higher- J line fluxes to explore the 
physical conditions in our targets' molecular gas and to 
test standard assumptions in the use of empirical con- 
version factors to relate their CO luminosities to their 
molecular gas masses. CO J = 1 — spectra of these 
galaxies are especially important because they constrain 
the state of their molecular interstellar media and may 
reveal the presence of massive reservoirs of extended cool 
gas that do not appear in lines from more excited states. 
In addition to tracing cool gas, the 1 — line is essential 
for interpreting mid-J lines to constrain the properties 
of warmer gas. In the local universe, multi-line observa- 
tions of nearby starburst and active galactic nuclei typi- 
cally sh ow low- an d high-excitation gas components (e.g. 
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Guesten et all IT991 iMao et al.l 12000; 
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ponents distinct in low- and high- J lines but contributing 
jointly to mid-J lines. Fluxes in the 1-0 and 2 — 1 lines 
are essential for characterizing the cool component well 
enough to determine the fraction of the mid-J emission 
that comes from each component. 

A secondary outcome of our observations was to test 
the use of the Zpectrometer's large fractional bandwidth 
(A///mcan = 34%) for blind redshift searches toward tar- 
gets identifled in continuum surveys. The Zpectrometer's 
instantaneous redshift coverage is flve to ten times larger 
than those provided by current millimeter-wave interfer- 
ometer band widths, offering the promise of quick CO 
redshift determinations for SMGs without waiting for ra- 
dio continuum mapping or optical spectroscopy. To that 
end, we developed a statistical test for line detection ap- 
propriate for long observations across wide bandwidths. 
The ability to identify sources directly from continuum 
positions known to within the 22" size of the GBT's Ka- 
band beam eliminates some selection effects that bias 
detection towards excited molecular gas. 

This paper has three further sections and an appendix. 
Section [5] describes our observational technique and in- 
strument. Section [3] covers our basic results for each 
source, including an overview of a detection method suit- 
able for wideband spectroscopy. Section [4] contains dis- 
cussion of the physical conditions in the galaxies we de- 
tected, the implications, and a brief summary. An ap- 
pendix describes the details of the detection statistic 
whose use we demonstrate here. 

2. OBSERVATIONS 

The combination of the Zpectrometer cross-correlation 
spectrometer and dual-channel Ka-band correlation re- 
ceiver is specifically intended for wideband observations 
of weak lines. The system's use of correlation makes 
it the single-dish equivalent of a two-element spatial in- 
terferometer (iBlum ,1959; .Harria .2005.) , with a similar 
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promise for high stabihty. The spectrometer has moder- 
ate velocity resolution that is well matched to extragalac- 
tic observations: its spectral response to a monochro- 
matic line is a sinc(a;) function with 20 MHz full width 
at half maximum, corresponding to about 190 km at 
band center. A correlation spectrometer is immune to 
some systematic effects in line detection experiments, as 
the narrow spectral feature corresponding to a line is not 
produced by a single detector, but is created by coherent 
structure across hundreds of independent lags. 

Physically, the Zpectrometer analog lag cross- 
correlator is installed next to the Ka-band receiver on 
the National Radio Astronomy Ob servatory's 100 me - 
ter diameter Green Bank Telescope ([Harris eTalllWil . 
Even in a digital age, analog multiplication still retains 
advantages of low power dissipation and low radiated 
emissions for broadband spectroscopy. Transistor ana- 
log multipliers separated by transmission line delays cas- 
cade to form the Zpectrometer's lag correlators. Sets 
of 256 lags are packaged in four identical independent 
cross-correlator units, each with 3.5 GHz bandwidth. A 
four-channel downconverter splits the receiver's IF band 
to stack the correlators in frequency space. Receiver per- 
formance at high frequency limits the bandwidth, so the 
spectra in this paper cover 10. 5 GHz, the th ree lowest- 
frequency correlator sub-bands. iHarrisI (|2005H covers the 
details of how the combination of a correlation receiver 
front end and analog lag cross-correlator backend differ- 
ences the power between the receiver's two input feed 
horns. This electronic differencing greatly reduces the 
effects of amplifier 1// gain noise, the dominant source 
of instability in good total power radiometers. Layers of 
optical switching (chopping, wobbling, nodding, beam- 
switching, double-beamswitching, etc.; the terminology 
is mixed) and electronic phase switching remove other 
nonideal signals to a very high degree, leaving spectra 
that are very clean compared with a conventional total 
power systems. 

While the correlation architecture greatly improved 
stability, switching the beam position on the sky by mov- 
ing the telescope subreflector (chopping) at a 10 second 
period was still necessary for usable stability. A chop 
throw of 78 arcsec, equal to the angular separation of the 
correlation receiver's two feed horns, alternately placed 
the source on one of the two horns, optically switching 
the source between the receiver's "plus" and "minus" 
sense beams. Differencing the spectra from the two sub- 
reflector positions (tilts) reduced electronic imbalances, 
but the slight difference in telescope illumination in the 
two subreflector positions introduced spectral structure 
from optical imbalance with a peak-peak amplitude of 
approximately 60 mJy. Changing the antenna position 
to view a reference position on the nearby sky and then 
differencing these "source" and "reference" subrefiector- 
switched spectra removed the optical imbalance to a high 
degree. This is the classical double-beamswitching or 
chop-and-nod pattern common to short-wavelength ra- 
dio and infrared astronomy; we used an 8 minute cycle 
time and telescope moves of no more than a few degrees. 
Rather than spending half of the time observing blank 
sky, we took advantage of the system's fixed-tuned op- 
eration and wide bandwidth by switching between two 
sources close in sky positions. This increases observing 



efficiency by a factor of two at the cost of not preserv- 
ing individual source continuum levels, and risking that 
lines from the two sources will fall at the same frequencies 
and cancel to a greater or lesser extent. We felt the gain 
in time offset the risk, and we preserved the data from 
each chop side to recover from this eventuality. Residual 
imbalance in the receiver caused some fluctuating large 
scale structure and an occasionally strong ripple across 
the spectral baseline on timescales short compared with 
the optical switching times. While this structure tended 
to cancel after long integrations, it was still present in 
the spectrum and added nonideal noise. Although the 
noise amplitude in spectra decreased as the square root 
of integration time, excess noise in the receiver was a fac- 
tor of two to four higher than the radiometer equation 
predicts. 

We made gain corrections across the band and estab- 
lished the intensity scale by dividing the source minus 
reference spectra by spectra of the standard radio flux 
calibrators obtained in the same observing sessions. We 
took absolute fluxes of 0.7 7 and 1.9 Jy at 32.0 GH z for 
3C48 and 3C286 from The I Astronomical Almanad . Ob- 
servations of Mars, compared against a physical model 
for Mars' emission (B. Butler, priv. comm.), indepen- 
dently verified the flux scale within a few percent and 
established the quasars' spectral indices across the band. 
We tracked the total power at the Zpectrometer's four- 
channel downconverter to monitor the system tempera- 
ture, but system temperature measurements were com- 
plicated by the presence of nonideal phase noise, which 
does not appear in total power. 

Stepping monochromatic signals with 8 MHz spacings 
across the receiver band calibrated the system phase 
and established the correlator's spectral frequency scale 
([Harris & Zmuidzinas 2001; Harris 2005). Signals from 
one of the GBT's microwave synthesizers were injected 
between one of the feed horns and the receiver's input hy- 
brid for this purpose. To simplify this phase calibration, 
the system local oscillators run at fixed frequencies, so 
the system frequency scale is topocentric. Doppler shift 
corrections to bring the velocity scale to a local stan- 
dard of rest (LSR) scale were applied in data reduction. 
We used version D of the Zpe ctrometer's data reduc tion 
pipeline, written in GBTIDL (jMarganian et al.ll2006l ). to 
make calibrated spectra and quick-look evaluation during 
observatio ns. Further d ata analysis was in R language 
{R Develo pment Core T eam 2006) routines. 

The telescope pointing was generally excellent when 
the wind was low (< 3 m s~^) and for elevations be- 
tween 75°and 20°. We pointed on a compact source near 
the astronomical targets once per hour, finding typical 
corrections of 0.1 arcmin or below, or about a third of 
the 0.35 arcmin beamwidth (0.84 of the peak amplitude 
for a Gaussian beam) at 30 GHz. 

We took spectra of the pointing source hourly to mon- 
itored systematic gain changes, and periodically mon- 
itored receiver gain with a modulated noise source at 
its input. Optical gain changes from pointing and focus 
errors were dominant, and we found that the system- 
atic overall calibration drifted by a maximum of 20% 
over an hour, less at lower frequencies than high. The 
drifts produced a bias that could slightly underpredict 
CO J 1 — line strengths. 
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3. RESULTS 

We observed the sources in pairs to cancel optical off- 
sets, as described in ^ so we summarize the results by 
source pair. Positions, numbers of observing sessions, 
and total integration times ti„t for each pair are listed in 
Table m 

Our data analysis for the spectra in this paper is un- 
tuned: we used an unweighted average of all the data 
from all of the sessions for all sources. Time- varying non- 
ideal noise dominated the spectral structure, so weighting 
by system temperature was not appropriate, and ad hoc 
weighting or editing based on measured session fluctua- 
tions or structure in individual spectra was not justified. 
No baseline structure has been removed other than nar- 
rowband filtering in the Fourier domain to remove an 
approximately 300 MHz period ripple produced in the 
receiver. Offsets from zero flux density in the spectra are 
reproducible but are different for different source pairs: 
they give the continuum level differences between the 
sources in the pairs. There is no sign of discontinuity 
between the three correlator sub-bands, even on strong 
continuum sources, a sign that the correlator's response 
is very linear in power. 

SMM J14011+0252 and SMM J14009+0252— The upper 
panel of Figured] is the full-band difference spectrum be- 
tween this pair of sources, showing detections for both: 
a positive spike corresponds to an emission line from the 
first source of the pair, SMM J1401H-0252 at z - 2.55, 
and a negative-going spike corresponds to emission from 
the second source, SMM J14009-)-0252 at z - 2.95. The 
binning in the top panel of Figure [T] does neither source 
justice because it is a compromise between that best 
suited to the detection of J14011+0252's narrow line and 
J14009-H0252's broad line. 

The lower panel in the figure contains a summary of 
many possible binnings and shows the detections much 
more clearly. We identify lines by exploiting the fact that 
even broad extragalactic lines are narrow on the scale of 
the Zpectronieter's bandwidth, searching for a relatively 
narrow peak across a spectrum whose noise changes with 
frequency. This is a detection confidence plot, a statis- 
tically sound quantitative embodiment of what an expe- 
rienced observer would do by eye: look to see whether 
a spectral channel or set of channels is higher than its 
neighbors, within fluctuations. Estimating noise levels 
by eye is complicated by the changes in noise across the 
spectrum's 25.6 to 36 GHz band; the fluctuation across 
all channels is not a good measure of the fluctuations 
within individual channels. Points in the figure sum- 
marize the results from our detection statistic, which is 
described in the Appendix, over a wide range of binning 
parameters. Each dot in the confidence plot represents 
the result from the detection statistic for one combina- 
tion of bin width and start channel index for the binning. 
In Figure [1] (and Figure 2]), bin widths run from n = 3 to 
10 and starting bins run from to n — 1. The triangles 
mark dots corresponding to the bin width and starting 
bin for the spectrum in the top panel, with one triangle at 
the center of each frequency bin. Columns of dots are fre- 
quencies where the detection statistic is high for a range 
of bin widths and center offsets. Isolated dots are most 
likely to be chance fiuctuations emphasized by a particu- 
lar set of binning parameters. Values on the vertical scale 



are the confidence levels for line detection. This is a very 
nonlinear scale, with units in "number of nines:" i.e., 
levels of 0.9, 0.99, 0.999, etc. Mathematically, the "num- 
ber of nines" is equal to -logio(l ~ P) - logio(^c/ian), 
where P is the probability of detection and the term 
with Nchan corrects the scale to account for the probabil- 
ity of a chance fluctuation given the number of channels 
in the spectrum. Without the channel number correc- 
tion a "3(t" excursion would be unremarkable: if drawn 
from normally-distributed noise, one should appear ev- 
ery 370 samples, on average, and each confldence plot 
contains results from 52 different binnings, each with 50 
to 200 channels, for a total of 9804 points. Regions where 
the columns coherently climb above 2 (0.99 confldence) 
give frequencies where a potential detection is insensitive 
to exact binning, indicating that the channel is reliably 
above or below the mean of its neighbors. 

Vertical dashed lines through Figure [1] show the fre- 
quencies of the 1 — lines measured from fits to the 
full-resolution unbinned 1 — spectra of the two sources 
shown over small frequency ranges in Figures [2] and [3l 
Since the 1-0 and 3 — 2 redshifts agree to high preci- 
sion, these also mark the 3 — 2 redshifts. An advantage of 
observing with a fixed-tuned broadband system is that 
it is possible to make direct comparisons of spectra and 
system performance for all sources. In all of the spectra 
we have taken with the Zpectrometer, there have been 
no signs of spurious detections, and specifically none at 
frequencies corresponding to either of these sources. 

The smooth lines in Figures [5] and [3] represent simple 
Gaussian profile interpretations of the 3 — 2 line parame- 
ters. Tables [2] and [3] give the fit parameters and compare 
them with the 3 — 2 parameters from the literature. At 
the Zpectrometer's maximum frequency resolution, spec- 
tral structure is correlated over about 3 bins; we correct 
for that in linewidth measurements. 

For SMM J1401H-0252, both the Figure and the tab- 
ulated data show that the agreement between the 1-0 
and 3 — 2 redshifts is excellent, and that the linewidths 
are v ery similar (3 — 2 data from [Pow ncs & Solomo3 
120031 which agrees wefl with 'Fraye ret al.l |1999). The 
integrated line intensity in Table [2] is from the Gaussian 
fit parameters to the line, with the 3 — 2/1 — inte- 
grated intensity ratio equal to 6.8. Individual velocity- 
integrated flux densities (in this paper, the "integrated" 
in integrated intensities and brightness temperatures im- 
plies integration over the full linewidth in velocity) are 
computed from the flt parameters for a Gaussian profile, 

/ S,ydv = 1.06 S^^peak ^vfwhm- 

More revealing than the flux density ratio is the in- 
tegrated J = 1 — and 3 — 2 brightness temperature 
ratio, 

_ /T(3-2)d.; ^ /5.(3-2)dz; / ^i_o y 
""^'^ /r(l-0)di; /5.(l-0)dt; V 1^3-2 y '^^ 

where {x — y) is the flux density in the J = x — y tran- 
sition and Vx^y is the frequency of the transition. Depar- 
tures of this ratio from unity reveal the departure from 
thick and thermalized emission from gas in the Rayleigh- 
Jeans limit. For this galaxy R^^ i = 0.76, lower than the 
value of unity expected in this single thick and thermal- 
ized limit. Errors in Table [2] and following tables are at 
the 68.3% (Ict equivalent for a normal distribution) confl- 
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Frequency [GHz] 

Figure 1. Difference spectrum of SMM ,114011+0252 and SMM J14009+0252 (upper panel) and confidence plot (lower panel). The 
triangles in the confidence plot mark the points corresponding to the binning for the spectrum in the top panel. As is apparent from the 
position of the triangles in the confidence plot, the binning for the top spectrum is the compromise necessary to show SMM J14011+0252's 
narrow line and SMM J14009+0252's broader line, so this spectrum shows neither line detection to its best advantage. In the confidence 
plot, the ordinate gives the number of nines (0.9, 0.99, etc.) in the confidence level; see text. 
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Table 1 

Summary of observations. A horizontal line separates the pairs of 

sources that were observed together. The total integration time 
tint is the elapsed observing time for each of the difference spectra, 
Figs. [1] and [1 



Name 


RA (J2000) 


Dec (J2000) 


No. sess. 




SMM J04431+0210 
SMM J04433+0210 


04:43:07.10 
04:43:15.00 


+02:10:25.1 
+02:10:02.0 


9 


9.8 


SMM J14011+0252 
SMM J14009+0252 


14:01:04.96 
14:00:57.68 


+02:52:23.5 
+02:52:48.6 


3 


3.1 




32.20 32.25 32.30 32.35 32.40 32.45 
Frequency [GHz] 



Figure 2. Spectrum of SMM J14011+0252 with Gaussian pro- 
file fit (curve) and Gaussian equivalent of t he 3 — 2 s pectral fit 
with flux density divided by nine (dotted, IDownes~fc Solomon 
[2003). 




28.8 29.0 29.2 29.4 29.6 29.8 

Frequency [GHz] 



Figure 3. Spectrum of SMM J14009+0252 with Gaussian pro- 
file fit (curve) and Gaussian equivalent o f the 3 — 2 spectral fit 
with flux density divided by nine f dotted. IWeifi eiral.li2009ai') . 



Table 2 

Summary of results and comparison to 3 — 2 data from lDownes & SolomonI I I2003I ) 
for SMM J14011+0252. 



Parameter CO (1 - 0) CO (3 - 2) 

LSR Redshift 2.5652 ± 0.0002 2.5652 ± 0.0001 

Peak flux density S„ [mjy] 1.85 ± 0.20 13.2 ± 1 

Line width [km s^^] < 208 190 ± 11 

Integrated line intensity [Jy km s^^] ~ 0.4 ± 0.05 2.8 ± 0.3 

Integrated intensity ratio, 7(3 - 2)//(l - 0) 6.6 ±1.0 

Integrated brightness temperature ratio, Rs^i 0.76 ± 0.12 



Table 3 

Summary of results and comparison to 3 - 2 data from lWeifi et al.l H2009al ) for 
SMM J14009+0252. Structure in the spectral baseline rendered the line fits too 

poor for believable error estimates, so we quote the best fit without errors for 
line width and peak intensity. Integrated intensity ratios and errors are derived 
from a bootstrap analysis; see text. 



Parameter CO (1 - 0) CO (3 - 2) 

LSR Redshift 2.9346 2.93450 ± 0.00035 

Peak flux density [mJy] 0.65 5.4 ± 0.9 

Line width [km s-l] 643 470 + 60 

Integrated line intensity [Jy km s^^] 0.44 2.7 ± 0.3 

Integrated intensity ratio, 7(3 - 2)//(l - 0) 6.1 ±0.7 

Integrated brightness temperature ratio, R:j^i 0.67 ± 0.08 
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dence level for the fits, as determined by the R language's 
conf int routine. 

SMM J14009+0252's line is clearly detected when the 
binning concentrates most of the flux into one channel 
(Fig. [T]), but nonideal structure in the baseline hinders 
precise line amplitude and width measurements from 
full-resolution spectra. Formal error estimates are con- 
sequently not realistic. Judging from the structure in 
Fig. [3l the ripple in the spectrum causes the fit to un- 
derestimate the true 1 — intensity and overestimate the 
linewidth; an integrated intensity estimate from the fit 
parameters is questionable. Instead, we derive the in- 
tegrated intensity and errors from a bootstrap analysis 
of the numerator of equation (jA2[) for the binning that 
yields the highest signal-to-noise detection, multiplying 
by a factor to account for flux outside the rectangular 
bin for a G aussian lineshape. With 3 — 2 data from 
iWeifi et all (pOOQa ). the 3-2/1-0 intensity ratio is 
6.1, or i?3,i = 0.67. 

SMM J04431+0210 and SMM J04433+0210— Figure H 
indicates that the only detection in this pair is of 
SMM J04431+0210; none of the negative-going struc- 
ture is detected with high significance. The spectrum 
in the upper panel is binned for the best signal to noise 
for this line. The vertical dashed line is the measured 
1-0 line frequency from a fit to the full-resolution 1-0 
spectrum (Fig. [5]). Since agreement between the 1 — 
and 3 — 2 redshifts are excellent, it could equally well 
indicate the 3 — 2 redshift. Figure [5] is a full-resolution 
spectrum, with the solid curve a single-component Gaus- 
sian profile fit. T he dotted line re presents the 3 — 2 line's 
triangular shape (jNeri et al.ll2003D , scaled to the 1-0 fre- 
quency, with the amplitude divided by nine. Lineshape 
differences between the two transitions are not large and 
could be due to noise in either spectrum or to physical 
substructure with somewhat different excitation condi- 
tions. Table 0] makes a numeric al comparison between 
the 1-0 and 3 — 2 (jNeri et al.ii2003 ) line parameters, 
with the 1-0 parameters taken from the Gaus sian profile 
fit. Taking the 3 — 2 integrated intensity from lNeri et al.l 
([2003), the 3 — 2/1-0 flux-density intensity ratio is 5.5 
and i?3 1 = 0.61. 

The nondetection of SMM J04433-h0210 could be due 
either to a redshift outside the Zpectrometer's coverage 
(2.2 < z < 3.5 for CO 1 - 0, 5.4 < z < 8.0 for CO 
2 — 1) or to a low CO flux. The source has no known red- 
shift, largely because its faint optical co unterpart is cha l- 
lenging for spectroscopy {K = 22 A: ,Fraver et al.ll200l ). 
although the existence of a radio counterpart suggests 
that it does not lie at a much higher r edshift than the 
z ~ 2 .5 radio-preselected SMGs of the iChapman et al.l 
([2005h sample. Given that SMM J04433+0210's 850 fim 
flux density of only 4.5 ± 1.5 r nJy is barely abov e the de- 
tection threshold of the SLS (jSmail et al.1l2002D . within 
the range where noise fluctuations can boost a weak de- 
tection significantly, we view low CO flux as the most 
likely reason for its nondetection. 

4. DISCUSSION 

We detected CO J = 1 — emission from the three 
sources in our samples that have known CO 3 — 2 line 
parameters. The similarities of line redshifts and velocity 
widths for the 1-0 and 3 — 2 lines justify a joint analysis 



of the two. For the sample given in Section [51 the mean 
and standard deviation of the means is i?3_i = 0.68±0.08. 
Compared with other SMGs, the mean ratio from our 
sample is slightly h igher than the rat io R3 i = 0.48 for 
SMM J02399-0136 ('Iv ison et al.l[20Tl . the brightest of 
the SLS galaxies in 850 /xm continuum, and is compara- 
ble to the ratio fo r the highly lensed SMG SMM J2135- 
0102 i?3,i = 0.68 (jSwinbank et al.ll2010l:lDanielson et all 
l2010r ). For this sample of five SMGs with currently 
known 1—0 and 3—2 fluxes, the ratio is R^^i = 0.64±0.10. 

A sample of five is not large, but neither is it so 
small that a few additional observations can dramati- 
cally change the result. For example, if the ratio for 
most SMGs is really R^^i = 1 and we have been un- 
lucky in our choice of galaxies, it will take observations 
of another thirteen galaxies with unity ratio to reach a 
sample mean ratio of i?3.i = 0.9. Roughly speaking, ob- 
servations of fifteen additional galaxies will be needed to 
reduce the dispersion by a factor of two. We can also es- 
timate the chances that we have drawn five of five galax- 
ies with R31 below the true mean. From the binomial 
theorem, the probability is 0.03, for a symmetrical dis- 
tribution where each draw has a probability 0.5 of being 
above or below the mean. These simple considerations 
indicate that our conclusion that i?3^i ~ 0.6 in SMGs is 
robust. A recent indep endent data set b ased on EVLA 
data gives i?3,i « 0.55 H vison "eran[20Tll) . in agreement 
with our value within errors. 

The values of mean and dispersion in R^^i carry two 
linked implications: the first one for mass estimates that 
rely on line luminosity scaling, and the second for the 
state of the typical interstellar medium in SMGs. 

Observers have had to use incomplete data along with 
assumptions and approximations to make mass estimates 
for distant galaxies. In mass derivations from millimeter 
wave spectroscopy of SMGs, which arc most often ob- 
served in the mid- J lines, a frequent implicit assumption 
is that the CO lines share the same excitation tempera- 
ture Tex from the observed mid- J transition down to the 
1-0 line at the base of the CO rotational ladder. An 
alternative to the assumption of constant T^x has been 
to fit observed line fluxes to a single-cornpone nt ISM ra- 
diative transfer model (e.g. iWeiss et al.l [20071) and then 
use the model predictions for unobserved line fluxes. The 
CO 1 — data we report probe the ground-state emission 
of the CO molecule to provide critical tests for these as- 
sumptions, which of necessity invoke simple interstellar 
media. 

The excitation temperature, T^x, is a very general con- 
cept that describes an energy density, whether kinetic, 
radiative, rotational, vibrational, spin, etc. In observa- 
tional molecular spectroscopy Tg^ is the measured quan- 
tity; for the rotational transitions of the CO molecule, 
the excitation temperature is the rotational temperature 
Trot- In this paper we use the term thermalized in its 
rigorous sense, to mean that Trot is equal for all tran- 
sitions of interest. Thermalized in this sense does not 
necessarily imply that the molecule is in local thermody- 
namic equilibrium (LTE), with rotational temperature 
equal to the kinetic temperature of the surrounding II2 
molecules, although this is often the case for low- J CO 
transitions and is a common use of the term. In a rigor- 
ous context the term subthermal indicates that the ex- 
citation temperature Trot for a given transition is below 
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Figure 4. Difference spectrum of SMM J04431+0210 and SMM J04433+0210 (upper panel) and confidence plot (lower panel). The 
triangles in the confidence plot mark the points corresponding to the binning for the spectrum in the top panel. See Fig. [l]s caption for an 
explanation of the confidence scale. 



Table 4 ^ 

Summary of results and comparison to 3 — 2 data from lNeri et al.l 120031) for 
SMM J04431+0210. 



Parameter CO (1 - 0) CO (3 - 2) 

LSR Redshift 2.5086 ± 0.0007 2.5094 ± 0.0002 

Peak flux density, [mjy] 0.58 ± 0.08 3.5 

Line width [km s-l] 415 ± 62 350 ± 60 

Integrated line intensity [Jy km s"!] 0.26 ± 0.05 1.4 ± 0.2 
Integrated intensity ratio, 7(3 - 2)//(l - 0) 5.5 ±1.4 
Integrated brightness temperature ratio, Rz^\ 0.61 ± 0.15 
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Figure 5. Spectrum of SMM J04431+0210 with Gaussian profile 
fit (curve) and triangular approximation to the 3 spectral line- 
shape with flux density divided by nine (dotted, INeri et al.ii2003l ) 




T[K1 

Figure 6. Values of Planck function ratios for optically thick 
and thermalized gas for J < 4. The ratios reach the Rayleigh- 
Jeans limit, in which the radiation and kinetic temperatures are 
approximately equal, when the frequency u of the higher frequency 
line is hv/k > T. 

Trot of a comparison transition from the same ensem- 
ble of molecules, without reference to T^im T^ad, or any 
other external bath. 

Strictly speaking, the usual practice of extrapolating a 
constant radiation temperature T^ad measured in a mid- 
J line for lower- J transitions is incorrect even if the den- 
sity is high enough that Trot = Tkm- the kinetic temper- 
ature in SMGs is unlikely to be high enough to drive Trad 
into the asymptotic Rayleigh- Jeans limit. As Figure |6] il- 
lustrates, the Planck radiation (brightness) temperature 
ratios for even low- J lines still climb toward unity for 
T < 100 K, so cx Trad 7^ Tki r,,. At a typical S MG 
dust temperature of about 40 K (|Blain eEal][2Q02) the 
correction from the Rayleigh- Jeans limit is a factor 1.2 
for the 4 — 3 line, with corrections increasing with J. If 
uncorrected, errors from the Rayleigh- Jeans approxima- 
tion add artificial roUoff to plots of CO line intensity vs. 
J when those plots involve scaling intensity by frequency 
squared to find the collapse of excitation with J. 

Returning to the observations, the small dispersion in 
^3,1 justifies an empirical scaling prescription between 
the two intensities for many sources, although there will 
doubtless be excursions far from the mean for individ- 
ual sources where scaling fails. In the local universe, an 
integrated brightness temperature of the CO 1 — line, 
Ico — /?i_odw, has been used extensively to estimate 
total molecular gas column densities. The 1-0 tran- 
sition traces molecular gas with a wide range of excita- 
tion and is relatively easy to observe in nearby sources. 
Although optically thick, it can trace mass by "cloud- 
counting" many individual clumps at different velocities 



within a telescope's beam (e.g. JDickman et al.lll986t ). In- 
tegrating Ico over projected area defines a luminosity 
L'nr\ that is directly p roportional to molecular gas mass; 
iSolomon et al.l ()1992[ ) derive a form appropriate at cos- 



mological distances as L'f- 



co 



Ico ^.-^ D\ , where il.. 



IS 



the source solid angle and angular diameter distance Da 
accounts for cosmology. Ico is related to the II2 column 
density iV(Il2) through the X factor, and L'qq to gas 
mass through the a factor. X and a differ by a constant 
factor and have been calibrated for the 1- line in Galac- 
tic, st arburst, and ULIRG environments; iTacconi et al.l 
l|2008( ) provide a brief critical review of applying these 
scale factors in the high-redshift universe. 

In the framework of scaling luminosity between differ- 
ent lines, the mean value of i?3_i — 0.64 we find shows 
that a scaling factor of about 1/0.64 = 1.5 is likely to be 
a more accurate predictor of the 1-0 integrated intensity 
from the 3 — 2 integrated intensity than a factor of unity. 
To first order, this will increase gas masses deduced from 
assuming ^coi-o ~ ^'c03-2 by a factor 1.5. 

Departures from a ratio of unity are not limited to 
the 3-2/1-0 pair. Finding values other than unity 
from actual 1 — observation s of SMGs is becoming 
commonplace. iHainline et al.l ([2006) and iCarilli et al.l 
([20 10.) found similar behavior in their 1 — observations 
of SMM J13120-H4242 and GN20, with an equivalent ra- 
tio i?4,i of 0.26 (with some uncertainty from low-level 
emission in line wings) and 0.45, respectively. The de- 
viations from a simple thick and thermalized model are 
in the same direction as we find, but are larger and the 
scatter in the ratio is much higher. 

Many of the earlier explanations for the lack of equal- 
ity in line brightness temperatures invoked subthermal 
(in the rigorous sense we discussed) excitation of the 
mid-J lines in such a simple, single-component ISM as 
an explanation for decreasing T^x with J. We do not 
agree with this approach because it is an unphysical dis- 
traction rather than a useful approximation. Multi-line 
observations of nearby galaxies (and, for that matter, 
Galactic giant molecular clouds) show interstellar me- 
dia with multiple components as defined by lineshape, 
emission from different molecular species, or detailed 
physical conditions. If the 1-0 and 3 — 2 emission 
in the SMGs we observe were from a single compo- 
nent of subthermally excited gas, the 3 — 2 intensity 
would be very sensitive to the detailed physical condi- 
tions. We modified an escape-pro bability radiative trans- 
fer co de (J. Stutzki, priv. comm . : I Stutzki k, WimiewisseH 
119851 : cross-sections from lFlower fc Launavlll985f ) to ex- 
plore conditions matching a conservatively broad range 
of i?3_i = 0.6 ± 0.2. To ensure our model produced rea- 
sonable emissivities, we eliminated solutions with optical 
depths less than unity in both lines. The formal solutions 
are in Figure [71 Solutions with densities much above 
10'^ cm^'^ are not likely to be of physical importance. 
First, the high density solutions occur at low tempera- 
tures, where the lines are extremely weak since they are 
close to the T = 9.6 K background temperature at the 
model's z = 2.5. Such lines will not dominate the molec- 
ular gas luminosity. Further, it is questionable whether 
the bulk of high density gas could be much colder than 
the bulk of the dust, which has a temperature of about 
40 K in SMGs (|Blain et al.ll200l . Overall, the model 
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Figure 7. Side views of the three-dimensional volume containing 
radiative transfer calculation results to identify the physical condi- 
tions that produce Rs^i = 0.6 ±0.2 by subthermal excitation. The 
plot is a summary of 27k models filling the volume with equal spac- 
ing as seen in the projected views. The points represent successful 
models, with size corresponding to the ratio: the largest diameter 
points are near the observed ratio of R3,i = 0.6 and the smallest 
S't ^^3,1 = 0.4 and 0.8. Results for optically thin lines have been 
removed from this summary to satisfy observational constraints. 
High-density, low-temperature results are not likely to be physical; 
see text. 

indicates that subthermally excited gas would have a 
density of a few x 10^ cm~'^ at a CO column density of 

5 X 10^* cm~^ in a 400 km s^^ linewidth, with lower 
densities possible at higher columns. 

Explanations for line ratios that require very specific 
physical conditions (or a very constant gravitational lens 
amplification ratio for the two lines) are not supported 
by the low dispersion of R31 from even our handful of 
SMGs. It is unrealistic to expect such a tight range of 
parameters from sample of galaxies selected without re- 
gard for their CO emission properties, especially so when 
observations of other lines directly contradict the special 
conditions. Interstellar media dominated by low-density 
material are incompatible wit h detections of CO J = T — 

6 from SMM J14011+0252 pownes fc SolomonI [2001 
and J = 5 - 4 from SMM J14009-I-0252 (|V\^yieLalJ 
[2009a): these lines are strong and rule out CO excita- 
tion that is collapsing by J = 3. The presence of ex- 
cited gas in these sources does not seem to be unusual: 
a com pendium of CO excitation diagrams in I Weiss et al.l 
(|2007D shows that the brightest CO lines are in states J 
of 5 to 6 for typical SMGs. 

Brightness temperature ratios Rz,i < 1 are not pe- 
culiar to SMGs , but are common in most g alaxies. 
lYao et all (|2003[ ) and iMauersberger et al.l ()1999l i report 
mean values of i?3^i = 0.66 and i?3,i = 0.63, respec- 
tively, in diffe rent sampl e s of lu minous galaxies in the lo- 
cal universe, lono et all (pOO^ find a mean i?3,i = 0.48 
for dusty U/LIRG s at a n altogether different redshift, 
and lAravenaet al.l (|2010l) find i?3,i = 0.61 for a z = 1.5 
BzK galaxy sel ected by its rest-U V/optical colors. At 
the same time, iBavet et al.l (|2006t ) find that CO ladder 
excitation peaks at J ~ 4 for the Galaxy and I C 342, and 
at J ~ 7 for starbursts, a type common in the lYao et al.l 



(|2003D and IMauersberger et all ()1999D samples. The ex- 
citation rolloff seems to shift to much highe r J for very 
luminous AGN hosts: iVan der Werf et al.l ()2010D find 
J = 13 - 12 is still st rong in the ULIRG Mrk 231, and 
Bradford et~aL[ ()2009[ ) find strong lines to J = 9 in the 
Cloverleaf QSO. 

A pattern of apparently subthermal line ratios from an 
ISM that also has strong emission from lines at higher 
J is a clear signature of a multi-component ISM. In any 
given line from an ISM with a range of physical condi- 
tions, the warmest optically thick regions with the largest 
area filling factors will dominate the emission. Basic ex- 
citation considerations suggest strong 1 — emission from 
distributed gas, while the 3 — 2 emission is dominated by 
star-forming and other regions with enhanced excitation. 
High spatial resolution obs ervations of SMGs do show 
varying size with excitation. Daniel son et al.l (j2010) find 
systematic lineshape an d size changes with J in CO emis- 
sion toward one source. iBothwell et al.l ()2010[ ) shows that 
mid- J CO emission regions' sizes are different than the 
sizes of the star formation as traced by radio continuum. 
Ivison (jivison et al.ll20lil ) make a comparison of emission 
sizes in the CO 1-0 and mid- J transitions. Evidence 
for higher excita tion in the centers of local U/LIRGs 
(jlono et al.l I2OO90 , which numerical m odels imply may 
also characterize high-redshift SMGs ([Narayanan et al] 
120091) ■ suggests that the balance between components is 
partly set on galaxy-wide scales. Naravanan ct al. (200!|) 
run hydrodynamic simulations of galaxy mergers that 
yield SMC luminosities, finding that the resulting ISM 
produces galaxy-averaged CO emission with line ratios 
that mimic subthermal excitation from a single compo- 
nent ISM. While this qualitatively matches the data, the 
observed 3-2/1-0 fiux ratio of 5.8 ± 0.9 is higher 
than the model results of 3 ± 1, indicating that the gas 
is somewhat more excited than predicted. Bringing the 
model results into better agreement with observations 
may be useful in refining modeling prescriptions. Over- 
all, given the wide range of galaxies that show a similar 
i?3^i and lack of correl ation with most physical properties 
that ' Yao et al.l (2003") find for i?3,i, it is plausible that 
the different lines trace different mixtures of regions in an 
ISM with a hierarchical (e.g. self-similar, fractal) geome- 
try, as is commonly found in high resolution observations 
of our Galaxy and emerges from numerical simulations 
of cloud structure. 

At this stage we do not have enough lines to identify 
ISM structures or to separate components through radia- 
tive transfer calculations, but we can draw some simple 
conclusions from basic considerations. The brightnesses 
of the lines and correlation between 1-0 and 3—2 intensi- 
ties indicate that the individual lines are optically thick. 
Deviations from R^ i ~ 1 then come from different geo- 
metrical filling factors of related components in a multi- 
component ISM, with a small contribution from failure 
to reach the asymptotic Rayleigh- Jeans limit (Fig. [H]). 

Finding a multi-component ISM in SMGs has impli- 
cations for the choice of relationship between CO 1 — 
intensity and mass, M = aL'fjQ. The value used most 
often for SMGs i s a = 0.8 (M(7) km s~^ P*^^)"^ (r eviewed 
with caveats in iSolomon fc Vanden Bond I2005D , which 
was derived for ULIRGs from radiative transfer calcula- 
tions based on an interpretation ofa2 — 1/1 — integrated 
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brightness temperature ratio below one as evidence of 
subth ermal excitation in ULIRGs (jPownes &: Solomonl 
119981) . An ISM similar to other galaxies favors values 
for a closer to calibrations from the dense parts of st ar- 
burst nuclei or the Galaxy (see iTacconi et alJ 120081 for 
a summary and references to individual studies). Com- 
pounding the factor of (i?3,i) = 1-5, an increase in 
a by a factor of about 2 would increase mass estimates 
for the SMGs by an overall factor of about 3. Increased 
mass estimates also imply decreased star formation ef- 
ficiency estimates. Large masses derived from scaling 
factors will run into upper limits set by dynamical mass 
measurements, and may indicate that the local (Galac- 
tic) [C0]/[H2] abundance ratio is lower than those in 
high-redshift SMGs. 

In summary, we find observationally that a luminosity 
ratio of L'^q{3 — 2) « 0.6 x L'^q{1 — 0) is more appropri- 
ate than the customary assumption of equality. Scaling 
the 3 — 2 integrated brightness temperatures to 1 — 
does seem to be empirically justified, but a factor of 
0.64~^ = 1.5 is more appropriate than unity. Increas- 
ing the scaling factors for both line integrated intensities 
and the conversion between line luminosity and mass, as 
suggested by the similarity of i?3_i in SMGs and local lu- 
minous galaxies, would increase mass estimates from ob- 
served 3 — 2 lines for SMGs further, unless conversion fac- 
tors are allowed to change with environment. Increasing 
the scaling factor would increase simple molecular mass 
estimates for SMGs and decrease their derived star for- 
mation efhciencies. It appears that simple line flux scal- 
ing breaks down beyond J = 3, as although the trend in 



the line ratios are similar, the dispersion in temperature 
ratios becomes substantially larger, probably because the 
higher- J lines probe past the average peak excitation. 
The large number of galaxies with similar i?3.i ratios, 
SMGs included, indicates that single-component models 
are inadequate descriptions of what must be more com- 
plex interstellar media. Identifying galaxies with R^ i 
considerably different from the typical value of 0.6 will 
be valuable in understanding the origin of the typical 
conditions in the ISM. 
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APPENDIX 

APPENDIX: LINE DETECTION CONFIDENCE METHOD 

Our detection confidence calculation is based on determining whether a channel has a systematically different ampli- 
tude from its neighbors. Setting a detection criterion for the line amplitude in a single channel would be simple if we 
somehow knew the mean value with no line present and had accurate knowledge of the system fluctuation within the 
channel. However, typical broadband spectra have offsets and other large-scale structures in the spectral baseline, and 
the noise may change with frequency as the receiver or system temperatures change. Estimating noise parameters by 
calculating, for instance, the standard deviation across frequency channels will be misleading in such cases. Comparing 
the mean value of a channel with those of its neighbors is more fruitful, as neighbors are likely to share offsets and noise. 
Limiting the number of neighbors increases the uncertainty of noise estimates in the spectral domain, however. The 
time sequence of data within each channel, sampled by the many subscans in a long integration, provides information 
on the fluctuations in each channel. 

Our detection calculation is based on comparing mean values between neighboring frequency channels, using noise 
estimates derived from the time sequences of individual channels to calculate the statistical significance of departures 
from the mean, with detection significance framed in a classical hypothesis test. Rather than testing for a detection 
of a line with unknown amplitude, we find the probability that the null case of no detection fails. If a spectral line 
is not present in the parent data, then the mean value in some channel X will be equal to the mean value of its M 
neighbors Yi: 



within fluctuations from noise. 

To estimate the fluctuation in each channel, we calculate sample variances Sj^ and Syi in the time sequence of each 
channel, derived from the N 4-minute subscans in our final spectra. This approach provides estimates for individual 
channels, independent of systematic structure across the spectrum. 
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Combining the difference in means and channel amphtude uncertainties, a suitable test statistic is: 




(A2) 



This is a useful form because it has a Student-t distribution with (M +1){N —1) degrees of freedom in the case that the 
signals in all channels X and Yi are from the same normally distributed parent population. While this condition cannot 
be strictly true in general, it can be a quick and reasonable approximation for channels close together in frequency 
and over times short compared with atmospheric changes; i-tests tend to be robust. As values of d become far from 
zero, the value produced by a nondetection, it becomes less likely that the true value is zero within fluctuations, and 
more likely that a real deviation from a mean value of zero has been detected. For an emission line search, where 
even a large negative excursion counts as a nondetection, the probability that the measured weighted difference d is 
consistent with zero is the one-tailed test: 



In searching for positive and negative excursions, as we would with our difference spectra, a two-tailed test is appro- 
priate. 

In practice, absolute probabilities given by equation (jA3[) are approximate because the assumptions of stationary 
statistics with equal variances for all neighbors are not strictly correct. In addition, this analytical approach has 
no way to accommodate weighting (e.g. for changes of atmospheric tran smission with time) durin g data analysis. 
To counter these shortcomings, we turned to the bootstrap technique (see lEfron &: Tibshiraniiri994[ ). creating boot- 
strapped spectra by randomly drawing the the same number of subscans as in the initial data set, with replacement 
(duplicates are allowed), from the pool of subscans. The bootstrapped spectra can then be suitably weighted averages 
of individual subscans. For the examples in this paper, we made 2000 bootstrapped spectra for each source pair, or 
2000 examples of spectra we might have observed assuming the po ol of subscans is representative of all data (this is 
the fundamental principle behind the bootstrap) . We then used Eq. (jA2p as the "studentized" detection statistic, since 
eliminating scale gives pivot al (varian ce stabilized) forms with superior properties in bootstrap confidence calculations 
(jEfron fc T ibshirani 1994; Zo ubir fc Iskandcr 2004). The resulting distribution of d should be very closely normal, with 
the central limit theorem acting on sums of the nearly-normal statistic d. We verified the assumption by comparing 
the distributions of d and the normal distribution on quartile-quartile plots, finding that the probabilities determined 
by counting bootstrap results and by the normal approximation were the same within fluctuations from finite sample 
length. 

Counting bootstrapped results that satisfy a condition and then normalizing by the number of bootstrap samples 
gives distribution-independent probability estimates, with the probability range bounded by the number of samples. 
Samples of 2000 are suitable for finding distribution-independent probabilities to about 0.99, for instance. To increase 
the confidence limit range we could have either increased the number of bootstrap samples or used the empirical 
agreement of the studentized d to the normal distribution. We chose the latter as a conservative approximation: the 
observed bootstrap result distribution should be, and empirically is, closely normal; because a normal approximation is 
insensitive to a few extreme results; and because of practical limits on computational time. Even with the calculation 
of d in a compiled code function to speed iterative calculations within the R language framework we use, typical run 
times for a full range of binning were about a minute, so increasing the number of samples by a few orders of magnitude 
from 2000 was impractical for interactive analysis. 
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